Efficient classical simulation of the semi-classical Quantum Fourier Transform 
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A number of elegant approaches have been developed for the identification of quantum circuits which can be 
efficiently simulated on a classical computer. Recently, these methods have been employed to demonstrate the 
classical simulability of the quantum Fourier transform (QFT). In this note, we show that one can demonstrate 
a number of simulability results for QFT circuits in a straightforward manner using Griffiths and Niu's semi- 
classical QFT construction [Phys. Rev. Lett. 76, 3228 (1996)]. We then discuss the consequences of these 
results in the context of Shor's factorisation algorithm. 
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An important part of understanding the power of quantum 
computation relative to its classical counterpart is the identifi- 
cation of those circumstances where both classical and quan- 
tum computation have equal power. One important aspect of 
this is to identify quantum circuits which can be efficiently 
simulated on a classical computer, particularly when these cir- 
cuits play a role in quantum algorithms believed to have an 
exponential advantage. 

The standard description of a quantum computation con- 
sists of the following steps. A register of quantum bits (qubits) 
is initialised in a certain state; Then a sequence of unitary 
quantum gates (the quantum circuit) are applied. Finally, each 
of the qubits is measured. The output of a quantum compu- 
tation is thus a classical bit string. This output is not usu- 
ally determinate - rather bitstrings are returned by a quan- 
tum computation according to a particular probability distri- 
bution. We will say that a device which outputs bitstrings dis- 
tributed according to this same probability distribution is sim- 
ulating the quantum computation. Of particular interest are 
simulations on classical computers which are efficient. This 
means that the resources needed to run the simulation (run- 
time, memory, etc.) scale polynomially (or at the very least 
sub-exponentially) with the size of the problem, usually the 
number of qubits taking part in the quantum circuit. We shall 
call a family of quantum circuits acting on a given family of 
input states classically simulable when an efficient method is 
known for its simulation on a classical computer. 

Several classes of quantum circuit are known to be clas- 
sically simulable when an appropriate simulation strategy is 
adopted. For example, classically simulable circuits include 
those consisting of Clifford group gates acting on input qubits 
in state |0)I11, circuits with restricted topological and depth 
properties 13, B 01 acting on product state inputs and circuits 
where the entanglement is limited at every stage 

These simulation strategies share a common feature - the 
identification of a suitable classical data structure to efficiently 
store the state or circuit, with efficient rules to update and eval- 
uate it. In the state vector formalism, where states are repre- 
sented as vectors of complex amplitudes and unitary operators 
as square matrices, families of circuits over increasing num- 
bers of qubits cannot generally be efficiently simulated, since 
the size of vectors and matrices increases exponentially. Nev- 
ertheless, Jozsa and Linden showed (5] that certain classes of 



quantum circuit remain simulable in this framework, provided 
the entanglement in the states is restricted in a way that we 
shall describe below. 

Recently, other data structures have been proposed. These 
include the stabilizer formalism \ \}, the matrix product state 
description [6, 7] and the weighted graph state approach, 
each giving an efficient representation of multi-qubit states, 
and efficient tensor network descriptions [2, 3, 4] of quantum 
circuits. Each of these approaches has lead to new examples 
of quantum circuits which are classically simulable. 

Very recently, the classical simulability of the quantum 
Fourier transform has been investigated jgi Tloll . The quantum 
Fourier transform (QFT) |TT] (over the field Z2„) is an im- 
portant family of quantum operations. It plays a special role 
in quantum computation theory, forming a key part of Shor's 
factoring algorithm ifTHl . The n-qubit QFT coherently trans- 
forms an input state \x) in the computational basis as follows; 

|x)^^^e-^-^|y) (1) 

where |a;) represents a qubit-string \xi . . . a;„). 

A family of quantum circuits which efficiently realise this 
operation 1 12] are illustrated in figure [T] Other circuit imple- 
mentations of the QFT have been proposed. Coppersmith il3ll 
noted that the size of the controlled phase-rotations in figure 
[Uget progressively smaller to the point where their effect on 
the final state is insignificant. By neglecting these small ro- 
tations, one can produce an approximate QFT circuit with an 
output which has a high-fidelity compared to that of the exact 
circuit, but with a reduced number of gates. Cleve and Wa- 
trous 1 14] proposed an approximate QFT circuit which can be 
parallelized to depth logarithmic in n. 

We shall call a unitary terminating if it occurs at the end a 
circuit, immediately preceding the (computational basis) mea- 
surement of all qubits. In Shor's algorithm, the QFT is termi- 
nating. Griffiths and Niu ifTsIl observed that when a controlled 
unitary is immediately followed by a measurement on the con- 
trol qubit, a coherent two-qubit implementation of the gate is 
not necessary. Instead, one can measure the control qubit be- 
fore the target qubit, and use the measurement result as a clas- 
sical control to determine whether to apply the single-qubit 
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FIG. 1: A schematic of the family of quantum circuits which implement the Ai'-qubit QFT, as independently proposed by Deutsch and 
Coppersmith fill [T2I1 . The controlled gates are controlled rotations R„ = exp[— i27ro-z2~'""^^')]. Note that this circuit also reverses the 
bit-order. 
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FIG. 2: Griffiths and Niu's "semi-classical" circuit for the terminating QFT. The dashed lines represent the flow of classical control data. 



gate. This can be expressed via the following circuit identity, 
where the flow of a classical bit is represented by a dashed 
line: 
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(2) 

With this identity one can rewrite the circuit for a termi- 
nating QFT so that it consists entirely of measurements and 
adaptive single-qubit operations. This is illustrated in figure|2l 

Employing a tensor network description |9, 10], two groups 
of authors have recently presented independent proofs of the 
classical simulability of the approximate QFT in a variety of 
contexts. In |9] a classical simulation method for the n-qubit 
Cleve and Watrous QFT circuit with sub-exponential n'^^'°s 
resource scaling. In |10] it was shown that any family of cir- 
cuits with a constant number of Coppersmith's QFT circuit 
and additional logarithmic depth limited range circuits can be 
simulated with classical resources polynomial in n. In both 
^ and iflOil the input state is assumed to be a product state. 

In this note, we shall show that adopting Niu and Griffiths' 
QFT circuit allows a very simple analysis of the simulabil- 
ity properties of the terminating quantum Fourier transform. 
We will show that a terminating QFT acting on product state 
inputs is classically simulable with a simple argument which 
takes just a few lines, in contrast to the elegant but somewhat 
complicated tensor methods employed in @] and ifioll . Fur- 
thermore, by combining this approach with prior results we 
are able to extend the argument to cover a much wider class 
of non-product input states. 

Classical simulability of terminating QFT with product 
state inputs. Niu and Griffiths' circuit for the terminating 



QFT consists of a series of adaptive single-qubit unitaries and 
measurements. Since there are no entangling gates, the circuit 
can be thus simulated by following the evolution of individual 
qubits one by one. For each qubit this consists of a sequence 
of (adaptive) unitary gates followed by a measurement. Both 
the update of a single qubit state under the action of a sin- 
gle qubit gates and calculating the probabilities of the two 
measurement outcomes require a small (constant) number of 
classical calculation steps. By sampling from this probability 
distribution and using the generated classical bit as the con- 
trol for the next rounds of single-qubit gates a simulation of 
the complete circuit is achieved. The full classical simulation 
consists of a linear number of simulated single qubit measure- 
ments and a quadratic number of single-qubit gates and is thus 
efficient. 

Classical simulability of terminating QFT with entangled 
inputs. Jozsa and Linden showed |5] that entangled states 
across arbitrary numbers of qubits may be efficiently repre- 
sented provided the state is p-blocked - in other words, for 
some fixed p the state can be written as a product of pure 
entangled states across subsets consisting of no more than p 
qubits. Furthermore, they showed that any family of circuits is 
classically simulable, if the entanglement remains p-blocked 
at all stages of the computation. 

Let us consider terminating QFT circuits with p-blocked 
input states. Since Niu and Griffiths' circuit contains no op- 
erations which can increase the entanglement of the state, the 
arguments of fl] tell us that the circuit is classically simulable. 

We can also consider other kinds of entangled input states. 
There exist classes of states which do not not fulfill Jozsa and 
Linden's p-blocked criterion, but which still have an efficient 
classical representation. These include matrix product states 
(MPS) and graph states, which we shall consider in turn. 
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Matrix product states are states which can be written in the 
following manner 0, 01 • 

1 

IV'>= E (3) 

where the coefficients Ci^...i^ can be represented in matrix 
product form 

Q,...,„ = ^y[m[''^ m!^'^^ ■ ■ ■ Aff (4) 

where '\ Ma^*^^ • • • , M^*"^ are a set of matrices. The 
dimensions of each matrix is Di x Di+i. The value Di rep- 
resents the Schmidt number |@1 over the partition of the state 
between qubit [i — 1) and (i). We label the maximal Schmidt 
number over all such partitions as x- The total number of pa- 
rameters describing the state scales as 0(nx^)- If a family of 
states over increasing number of qubits n is bounded in x then 
it can be efficiently represented in this framework. 

Let us consider the action of the n-qubit terminating quan- 
tum Fourier transform on a set of input states with a matrix 
product state description with bounded x- Vidal |0] and Yoran 
and Short |3] showed that to update a matrix product state 
description under the action of a single qubit unitary gate re- 
quires O(x^) elementary operations |6] and does not increase 
X- In addition, to update the MPS description after a compu- 
tational basis measurement on one on the qubits [3] requires 
0(npoly(x)) operations. Referring back to Niu and Griffiths' 
circuit we thus see that the action of the terminating QFT on 
a family of MPS input states with bounded x is classically 
simulable. 

Graph states [|16|1 are a generalisation of the cluster states 
introduced by Briegel and Raussendorf fv^. Each state is 
associated with a graph consisting of vertices and edges con- 
necting vertex pairs. Every vertex on the graph is associated 
with a qubit prepared in state |+) = (l/v^)(|0) + |1)). The 
graph state associated with the graph is generated by appli- 
cation of controlled-CTz gates between all each pair of qubits 
whose vertices are connected on the graph. 

Single-qubit measurements on certain classes of graph 
states are universal for quantum computation lUsil . However 
single qubit measurements on families of graph states with 
certain topologies e.g. for graphs which are one-dimensional 
|l9] or square lattice graph states with logarithmic width 
|j3i |2QJ the evolution of the state under single-qubit measure- 
ments can be efficiently classically simulated. Since the ter- 
minating QFT can itself be implemented with adaptive single- 
qubit measurements, the same reasoning applies and with all 
such classes of graph states as input, the terminating QFT will 
be classically simulable. 

Discussion. Since the terminating QFT is shown to be clas- 
sically simulable for such a wide range of input states, one 
may ask whether this can be exploited for the efficient simula- 
tion of Shor's algorithm. In Shor's algorithm, states generated 
from a modular exponentiation circuit (applied to a uniform 
superposition of computational basis states) are fed into a ter- 
minating QFT. 



The results above tell us that if the quantum states returned 
by the modular exponentiation belonged to any of the classes 
of states considered above, then in that case, the factorisation 
algorithm would be classically simulable. In |5] Jozsa and 
Linden consider the possibility of such states being p-blocked. 
They find that the modular exponentiation does, indeed, typ- 
ically generate non-p-blocked states, i.e. with entanglement 
ranging over an unbounded number of qubits. It is interesting 
to note that their argument does not prove that there are not 
special cases where p-blocked states do arise. 

It is worth discussing what implications these results have 
for our understanding of where the non-classical power of 
Shor's algorithm lies. These results throw into relief the im- 
portance of the entanglement properties of the states generated 
by the first stage of the algorithm - the modular exponentia- 
tion. One should not, however, rush to consider modular ex- 
ponentiation alone to be the more important stage of the algo- 
rithm. This would be overhasty, not the least because modular 
exponentiation is also, when terminating, a classically sim- 
ulable operation. This is easy to see, by recalling that the 
input state to the modular exponentiation is a uniform super- 
position of all n-bit string states. The output of the modular 
exponentiation is the state \ f{x)). A computational basis 
measurement on the output register will reveal one of the bit 
strings f{x), which will occur with probability 2^". Thus an 
efficient classical simulation strategy is immediately apparent 
- one chooses a particular classical input string x according 
to the probability distribution of the input state (in this case 
a uniform distribution), and then one calculates f{x) classi- 
cally. This emphasises that it is the coherent interface between 
the modular exponentiation and the QFT which allows Shor's 
algorithm to exhibit its (believed) speed-up. 

Conclusion. We have shown that Griffiths and Niu's semi- 
classical construction of the terminating quantum Fourier 
transform gives a simple way to understand why the QFT, 
when considered in isolation, is classically simulable. Lever- 
aging results from the literature, we have used the same ap- 
proach to show that for a wide variety of classes of input state 
(with efficient classical description) this simulabilty property 
remains. The methods here could be applied to any terminat- 
ing circuit which can be implemented via single-qubit mea- 
surement and adaptive single qubit unitaries. These results 
illustrate the special status of operations occurring in the ter- 
minating position in a quantum circuit for questions of sim- 
ulability and highlight that component circuits that are indi- 
vidually classically simulable can still be combined to form 
circuits with an apparent quantum speed up. We hope that 
they also help shed a little more light on the fascinating ques- 
tion of from where quantum computation does and does not 
get its power. 
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